Abstract 

This paper introduces a new measure-conjugacy invariant for actions of free groups. 
Using this invariant, it is shown that two Bernoulli shifts over a finitely generated free 
group are measurably conjugate if and only if their base measures have the same 
entropy. This answers a question of Ornstein and Weiss. 
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1 Introduction 

This paper is motivated by an old and central problem in measurable dynamics: given two 
dynamical systems, determine whether they are measurably-conjugate, i.e., isomorphic. Let 
us set some notation. 

A dynamical system (or system for short) is a triple (G, X, //) where (X, /i) is a proba- 
bility space and G is a group acting by measure-preserving transformations on (X, //). We will 
also call this a dynamical system over G, a C7-system or an action of G. In this paper, G 
will always be a discrete countable group. Two systems (G, X, fi) and (G, Y, v) are isomor- 
phic (i.e., measurably conjugate) if and only if there exist conull sets X' C X, V C Y 
and a bijective measurable map (p : X' — > V such that _1 is measurable, 4>*n = v and 
<f>(gx) = g<f>(x)Vg e G,x e X'. 

A special class of dynamical systems called Bernoulli systems or Bernoulli shifts has 
played a significant role in the development of the theory as a whole; it was the problem 
of trying to classify them that motivated Kolmogorov to introduce the mean entropy of 
a dynamical system over Z [Ko58, Ko59]. That is, Kolmogorov defined for every system 
(Z, X, /i) a number h(Z, X, fx) called the mean entropy of (Z, X, /i) that quantifies, in some 
sense, how "random" the system is. See [Ka07] for a historical survey. 

Bernoulli shifts also play an important role in this paper, so let us define them. Let (K, k) 
be a standard Borel probability space. For a discrete countable group G, let K G = Ylgtc ^ 
be the set of all functions x : G — > K with the product Borel structure and let k g be the 
product measure on K G . G acts on K G by (gx)(f) = x(g~ l f) for x G K G and g,feG. 
This action is measure-preserving. The system (G, K G , k g ) is the Bernoulli shift over G 
with base (K, k). It is nontrivial if k is not supported on a single point. 
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Before Kolmogorov's seminal work [Ko58, Ko59], it was unknown whether all non- 
trivial Bernoulli shifts over Z were measurably conjugate to each other. He proved that 
h(Z, K z , k 1 ) = H(k) where H(k), the entropy of k is defined as follows. If there exists a 
finite or countably infinite set K' C K such that k(K') = 1 then 

h(k) = - MW) i°g(M{*})) 

keK> 

where we follow the convention 01og(0) = 0. Otherwise, H(k) = +00. Thus two Bernoulli 
shifts over Z with different base measure entropies cannot be measurably conjugate. 

The converse was proven by D. Ornstein in the groundbreaking papers [Or70a, Or70b]. 
That is, he proved that if two Bernoulli shifts (Z, K z , k 1 ), (Z, L z , A z ) are such that H(k) = 
H(X) then they are isomorphic. 

In [Ki75], Kieffer proved a generalization of the Shannon-McMillan theorem to actions 
of a countable amenable group G. In particular, he extended the definition of mean entropy 
from Z-systems to G-systems. This leads to the generalization of Kolmogorov's theorem to 
amenable groups. 

In the landmark paper [OW87], Ornstein and Weiss extended most of the classical entropy 
theory from Z-systems to G-systems where G is any countable amenable group. In particular, 
they proved that if two Bernoulli shifts (G, K G , k g ), (G,L g ,\ g ) over a countably infinite 
amenable group G are such that H(k) = H(X) then they are isomorphic. Thus Bernoulli 
shifts over G are completely classified by base measure entropy. 

Now let us say that a group G is Ornstein if H(k) = H(X) implies (G, K G , k g ) is 
isomorphic to (G, L G , \ G ) whenever (K, k) and (L, A) are standard Borel probability spaces. 
By the above, all countably infinite amenable groups are Ornstein. Stepin proved that any 
countable group that contains an Ornstein subgroup is itself Ornstein [St 75]. It is unknown 
whether every countably infinite group is Ornstein. 

In [OW87], Ornstein and Weiss asked whether all Bernoulli shifts over a nonamenable 
group are isomorphic. The next result shows that the answer is 'no': 

Theorem 1.1. Let G = (s±, . . . , s r ) be the free group of rankr. If(K 1 , k±), (K 2 , k 2 ) are stan- 
dard probability spaces with H(k\) + H(k 2 ) < 00 then (G, K G , k g ) is measurably conjugate 
to (G, K G , k g ) if and only if H(k\) = H(k 2 ). 

The reason Ornstein and Weiss thought the answer might be 'yes' is due to a curious 
example presented in [OW87]. It pertains to a well-known fundamental property of entropy: 
it is nonincreasing under factor maps. Let (G, X, //) and (G, Y, v) be two systems. A map 
: X — > Y is a factor if = v and <f>(gx) = g<f>{x) for a.e. x G X and every g G G. If 
G is amenable then the mean entropy of a factor is less than or equal to the mean entropy 
of the source. This is essentially due to Sinai. So if K n = {1, ... ,71} and K n is the uniform 
probability measure on K n then (G,K g ,k, g ), which has entropy log(2), cannot factor onto 
(G, Kf, n G ), which has entropy log(4). 

The argument above fails if G is nonamenable. Indeed, let G = (a, b) be a rank 2 free 
group. Identify K 2 with the group Z/2Z and K 4 with Z/2Z x Z/2Z. Then 

4>{x){g) := (x(g)+x(ga),x{g)+x(gb)) G K G ,g G G 
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is a factor map from (G,K g ,k g ) onto (G,K g ,k g ). This is Ornstein- Weiss' example. It is 
the main ingredient in the proof of the next theorem, which will appear in a separate paper. 

Theorem 1.2. LetG be any countable group that contains a nonabelian free subgroup. Then 
every nontrivial Bernoulli shift over G factors onto every Bernoulli shift over G. 

To prove theorem II .![ the following invariant is introduced. Let (X, li) be any probability 
space on which G = (si, . . . , s r ), the rank r free group, acts by measure-preserving transfor- 
mations. Let a = {Ax, . . . , A n } be a partition of X into finitely many measurable sets. Let 
B(e, n) C G denote the ball of radius n centered at the identity element with respect to the 
word metric induced by S = {sf 1 , . . . , sf 1 }. The join of two partitions a,(3ofX is defined 
by a V (3 = {A H B | A E a, B G (3). Let 

H(a) := -5>(^)k)g(M^)), 

Aea 

r 

F(a) := (l-2r)F(a) + ^%Vs ! «), 

i=i 

a n := \J gat, 

gdB{e,n) 

f(a) := infF(a"). 

n 

A partition a is generating if the smallest G-invariant a-algebra containing a is the a- 
algebra of all measurable sets (up to sets of measure zero). The main theorem of this paper 
is: 

Theorem 1.3. Let G = (s\, . . . , s r ) . Let (G, X, ll) be a system. If a and ft are finite measur- 
able generating partitions of X then /(a) = f(/3). Hence this number, denoted f(G,X,/i), 
is a measure- conjugacy invariant. 

Theorem 15.21 below implies that if \K\ < oo then f{G, K G , kP) = H(k). This and Stepin's 
theorem proves theorem 11.11 A simple exercise reveals that if r — 1, then f(G,X,Li) = 
h(G, X, li) is Kolmogorov's entropy. 

Here is a brief outline of the paper. In the next section, standard entropy-theory def- 
initions are presented. In §31 an equivalence relation, called combinatorial equivalence, is 
introduced on the space of finite partitions of X, where (X,fi) is a standard probability 
space on which a countable group G acts. We prove that the combinatorial equivalence class 
of a finite generating partition is dense in the space of all generating partitions. In §H we 
introduce an operation on partitions called splitting and show that any two combinatorially 
equivalent partitions have a common splitting. This culminates in a condition sufficient for 
a function from the space of partitions to R to induce a measure-conjugacy invariant. In §3 
this condition is shown to hold for the function F defined above. This proves theorem 11.31 
Then we prove theorem 15.21 (that f(G,K G ,K G ) = H(k) if \K\ < oo) and conclude theorem 

o 



4 



Acknowledgments: I am grateful to Russell Lyons for suggesting this problem, for 
encouragements and for many helpful conversations. I would like to thank Dan Rudolph for 
pointing out errors in a preliminary version. I would like to thank Benjy Weiss for many 
comments that helped me to greatly improve the exposition of this paper and simplify several 
arguments. 

2 Some Standard Definitions 

For the rest of this section, fix a standard probability space (X,fj). 

Definition 1. A partition a = {Ai, . . . , A n } is a pairwise disjoint collection of measurable 
subsets Ai of X such that Uf =1 Ai = X. The sets Ai are called the partition elements of 
a. Alternatively, they are called the atoms of a. Unless stated otherwise, all partitions in 
this paper are finite (i.e., n < oo). 

If a and (3 are partitions of X then we write a — f3 a.e. if for all A e a there exists B e (3 
with fi(AAB) = 0. Let V denote the set of all a.e.-equivalence classes of finite partitions of 
X. By a standard abuse of notation, we will refer to elements of V as partitions. 



Definition 2. If a, (3 e V then the join of a and (3 is the partition a\/j3 = {AnB\Ae 
a, B E 13}. 



Definition 3. Let T be a a-algebra contained in the a-algebra of all measurable subsets of 
X. Given a partition a, define the conditional information function I{a\!F) : X — > R 



where A x is the atom of a containing x. Here n(A x \J-') : X — > K. is the conditional expectation 
of Xa X ) the characteristic function of A x , with respect to the a-algebra T. 
The conditional entropy of a with respect to T is defined by 



If (3 is a partition then, by abuse of notation, we can identify (3 with the a-algebra equal 
to the set of all unions of partition elements of f3. Through this identification, I(pt\f3) and 
H{a\(3) are well-defined. Let 1(a) = I(a\{$,X}) and H(a) = H(a\{&,X}). 

Lemma 2.1. For any two partitions a, (3 and for any two a-algebras T\,Ti with T\ C T2, 



by 



7(a|^(a;) = -log [}i(A x \T)(x 



)) 




%V (3) = H(a) + H(P\a), 



H{a\T 2 ) < H(a\T 1 ). 



Proof. This is well-known. For example, see [G103, Proposition 14.16, page 255]. 



□ 
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Definition 4 (Rokhlin distance). Define d : V X V — > M by 

d(a,/3) = #H/3) + ff(/3|a) = 2H(a V P) - H(a) - H((3). 

By [Pa69, theorem 5.22, page 62] this defines a distance function on V. If G is a group acting 
by measure-preserving transformations on (X, fi) then the action of G on V is isometric. I.e., 
if g £ G, a, f3 £ "P then d(pa, = d(a, From now on, we consider P with the topology 
induced by d(-, •). 

Definition 5. Let G be a group acting on (X, //). Let a be a partition of X. Let S a be the 
smallest G-invariant cr-algebra containing a. Then a is generating if for any measurable 
set A C X there exists a set A' £ S a such that fi(AAA') = 0. Let V gen C P denote the set 
of all generating partitions. 

3 Combinatorially Equivalent Partitions 

For this section, fix a countable group G and an action of G on a standard probability space 
(X, /j,) by measure-preserving transformations. 

Definition 6. Given a £ V and F C G finite, let a F = V/gF f a - 

Definition 7. If a, [3 £ P are such that for all A £ a there exists B E (3 with — 5) = 
then we say that a refines (3 and denote it by a > (3. Equivalently, j3 is a coarsening of a. 

Definition 8. Let a, (3 E V. We say that a is combinatorially equivalent to (3 if there 
exist finite sets L,M C G such that a < [3 L and /3 < a M . Let P e5 (a) C V denote the set of 
partitions combinatorially equivalent to a 

The goal of this section is to prove theorem 13.31 below: if a is a generating partition then 
V eq (ot) is dense in the subspace of all generating partitions. 

Lemma 3.1. Let a be a generating partition and (3 = {E>i, . . . , B m } £ V. Let e > 0. Then 
there exists a partition [3' = {B[, . . . , B' m } and a finite set L C G such that a L > (3' and for 
alli = \...m, ^(BiABfi < e. 

Proof. Since a is generating, there exists a finite set L C G such that for every i £ {1, . . . , m}, 
there is a set B", equal to a finite union of atoms of a L , such that (/.(B^AB") < —. For 
i — 1 ... m — 1, let 

^ :=X-\jB , i = B» n u\jB?nB;. 

Observe that for all i — 1 ... m, 

Bt-U B'JABj C B[ c Bi U [J Sj'ABj. 
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Thus 

fiffiABi) < m(-) =e. 
By construction, f3' = {B[, . . . , B' m } < a L . 



□ 



Lemma 3.2. Let a = {Ai, . . . ,A n } £ V and f3 £ V gen - Let e > 0. Then there exists 
a finite set M C G such that for all finite L C G with M C L, the partition elements 
{Bf, . . . , B^ L } of f3 L can be ordered so that there exists an r £ {1, . . . , itil} and a function 
f : {1, 2, . . . r} — > {1, 2, . . . , n} so that for all i £ {1, . . . , r} ; 

m) - 

and 



i>r 



Proof. Let 5 > be such that 5 < . By the previous lemma, there exists a partition 

a' = {A[,..., A' n } £ V and a finite set M C G such that a' < (3 M and /i(^A^) < 5 for all 
i Let L be any finite subset of G with M C L. 

Let /3 L = , . . - ,B^ L } and let / : {1, . . . ,m L } — > {1, . . . , n} be the function /(i) = j 
if \x(B\ — A'-) = 0. This is well-defined since f3 L refines a'. 

After reordering the partition elements of f3 L = {B± , . . . , B^ L } if necessary, we may 
assume that there is an r £ {0, . . . , m^} such that, if r > then for all i < r, 

v(BtnA m ) r 



and if i > r then 

So if % > r then 
So 



<1-V5. 



KBfnA m ) 



KBt) 

/ i(^ L nA /w )<(i-v / 5)M^ L ). 

H(B±) = ^-A m )+^(BfnA f(i) ] 



< ^-A m ) + (1-V5)^). 

Solve for n(Bf') to obtain 
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Since the atoms Bf are pairwise disjoint, it follows that 




Since ^{Bf — A'f,*) = 0, it must be that Bf — Af^ C A'f,* — Af^, up to a set of measure 
zero. So, 




< n\/5 < e. 



□ 

Theorem 3.3. If a is a generating partition then 

gen C T~*eq\Pt) • 

I.e., the subspace of partitions combinatorially equivalent to a is dense in the space of all 
generating partitions. 

Proof. Let a = {Ax, . . . , A n } and (3 = {£>i, . . . , B m } e V gen . Without loss of generality, 
we assume that //(Aj) > for all % = 1 . . . n. Let e > 0. By the previous lemma, there 
exists a finite set L C G such that the atoms of f3 L = {Bf , . . . , B^ } can be ordered so that 
there exists an r £ {1, . . . , m^} and a function / : {1, 2, . . . r} — > {1, 2, . . . , n} so that for all 
i G {l,...,r}, 

and 

/i(LK) <e - w 

By choosing e small enough (if necessary) we may assume that / is onto (for example, by 
choosing e to be smaller than |/i(A,) over all j = 1 . . . n). 

By definition of (3 L , tul < m' L L If necessary, we may assume that tul = ^' L ' after 
modifying f3 L by adding to it several copies of the empty set. That is, for some i, it may 
occur that B\ = 0. 

Let 5 > be such that 5 < e. By lemma 13.11 there exists a partition 7 = {C\, . . . , C m } 
and a finite set M C G such that 7 < a M and /i(CiABi) < 5 for all i. By choosing 5 small 
enough we may assume the following. Let 7 L = {Cf , . . . , C^ L }. Then, after reordering the 
atoms of 7 L if necessary, 

mi 

»{{jCf-Bf)<e. (2) 
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Let 

C[ = {x G Ci | if x G Cf for some j then x G ^/(j)} 

rn L 

Let Cjj = Q n A,- - CJ. Let 

71 = {C; | i = 1 . . . m} U {C,,,- \l<i,j < m}. 

Note that 71 < (a M ) L = a LM where LM = {lm \ I G L,m G M}. We claim that 71 
is combinatorially equivalent to a. Let Si be the smallest G-invariant collection of subsets 
of X that is closed under finite intersections and unions and contains the atoms of 71. It 
suffices to show that every atom of a is in Si. Observe that, for each i, Ci — C[ U Uj=i 
Hence, Ci G S^ Therefore the atoms of 7 L are also in S^ Since / is onto, the definition of 
C'i implies 

C l nA p = u{C>ncf \ f(j)= P }. 

So C[ n A p is in Si for all i,p. Now d H A p = C iyP U (C- n A p ). So Q D A p G Si for all i,p. 
Since 

m 

A p = Ci n A p , 

i=l 

Ap G Si. Since p is arbitrary, this proves the claim. Thus 71 G V eq {a). 
We claim that /i(C-ACj) < 3e for all i. By definition, 

m L 

C'iACi = Ci-C'i(z{JCf-A m . 
j'=i 

For each j, 

C^-A^c^-^U^-A^). 

Thus, 

mi, r 

CjACi C [J(C? - Bf) U [J(^ - U \J(Bf - A m ). (3) 

j=l j=\ j> r 

If j < r, then by definition of r, 

v(Bf n A /0) ) 

This implies 

M^-4o))<^)- 
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Thus 



KU^-4a))<E e ^ L )<e. (4) 

j=i i 

Equations El El 0] and [T] imply the claim. 

Since 5 < e and //(CjASj) < 5 for all i, the above claim implies that fi(C-ABi) < 4e for all 
Thus we have shown that for every e > 0, there exists a partition 71 = {C[, . . . , C' m , . . .}, 
combinatorially equivalent to a, containing at most m + m 2 partition elements and such that 
/i(C-A-Bj) < 4e for i = 1 . . . m. This implies that ft is in the closure of V eq (a). Since ft is 
arbitrary this implies the theorem. □ 



4 Splittings 

In this section, G can be any finitely generated group with finite symmetric generating 
set S. Let (X, //) be a standard probability space on which G acts by measure-preserving 
t r ansf or mat ions . 

Definition 9. Let a be a partition. A simple splitting of a is a partition a of the form 
a = a V sft where s G S and ft is a coarsening of a. 

A splitting of a is any partition o that can be obtained from a by a sequence of simple 
splittings. In other words, there exist partitions ao,«i, • • • , OL m such that «o = ®, ot m = a 
and a i+ i is a simple splitting of a, for all 1 < i < m. 

If a is a splitting of a then a and a are combinatorially equivalent. The splitting concept 
originated from Williams' work [Wi73] in symbolic dynamics. 

Definition 10. The Cayley graph T of (G, S) is defined as follows. The vertex set of T is 
G. For every s e 5 and every g G G there is a directed edge from g to gs labeled s. There 
are no other edges. 

The induced subgraph of a subset FcGis the largest subgraph of V with vertex set 
F. A subset F C G is connected if its induced subgraph in T is connected. 

Lemma 4.1. If a, ft G V, a refines ft and F G G is finite, connected and contains the 
identity element e then 

«V \/ fft 

is a splitting of a. 

Proof. We prove this by induction on \F\. If \F\ = 1 then F = {e} and the statement is 
trivial. Let f G F — {e} be such that F\ — F — {/ } is connected. To see that such an f 
exists, choose a spanning tree for the induced subgraph of F. Let fo be any leaf of this tree 
that is not equal to e. 
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By induction, a,\ := a V V/gf" 1 f0 ls a splitting of ol. Since F is connected, there exists 
an element f\ G F\ and an element S\ E S such that f\S\ = Jq. Since /i G -F\, «i refines 

(/r 1 ^). Thus 

is a splitting of a. □ 

Proposition 4.2. Lei a,/3 6e two combinatorially equivalent generating partitions. Then 
there is an n > such that 

a n = \J ga 

g£B{e,n) 

is a splitting of 0. Here B(e,n) is the ball of radius n centered at the identity element in G 
with respect to the word metric induced by S. Of course, a n is also a splitting of a. 

This proposition is a variation of a result that is well-known in the case G = Z in the 
context of subshifts of finite-type. For example, see [LM95, theorem 7.1.2, page 218]. It was 
first proven in [Wi73]. 

Proof. Let L, M C G be finite sets such that a < L and < a M . Let l,m G N be such 
that L C B(e,l) and M C B(e,m). So a < [3 l and (3 < a m . Since balls are connected and 
a < j3 l , the previous lemma implies f3 l V a m+l is a splitting of /3 , and therefore, is a splitting 
of 13. But f3 l Va m+I = ((3Va m ) 1 = a m+l . □ 

Theorem 4.3. Let $ : V —>■ R be any continuous function. Suppose that $ is monotone 
decreasing under splittings; i.e., if a is a splitting of a then $(<x) < $(a). Define <p '■ V —> R 

0(a) = lim = inf $(a n ). 

Then, for any two finite generating partitions ati and cti, 0(ai) = 4 > { a 2)- we ma y 
define <p(G,X,fi) = <f)(a) for any finite generating partition a. The number <p(G,X,fi) is a 
measure- conjugacy invariant. 

Proof. Let a and f3 be two combinatorially equivalent finite partitions. We claim that <ft(a) = 
<j>(0). To see this, suppose, for a contradiction, that 0(a) < (f)(0). Then there exists an n > 
such that $(a n ) < (f)(0). But by the previous proposition, there is an m > such that f3 m 
is a splitting of a n which implies $(a n ) > $(/9 m ) > (f)(0), a contradiction. So 4>(a) = (f)(0). 

For n > and a G P, let $ n (a) = $(a n ). Since $ is continuous and the map a i— > a™ 
is also continuous, it follows that $ n is continuous. Since 0(a) = inf n $ n (a), it follows that 
is upper semi-continuous, i.e., if {0 n } is a sequence of partitions converging to a then 
lim sup n (f)(0 n ) < 0(a). 

Now let a, /? be two finite generating partitions. By theorem 13.31 there exist finite 
partitions {0 n }'^ = i combinatorially equivalent to such that {/3 n }^i converges to a. So 
(f)(0) = limsup^ 4>(0 n ) < 0(a). Similarly, 0(a) < (f)(0). So 0(a) = 0(/3). 

□ 
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5 The /-invariant 

In this section, G = (s%, . . . ,s r ). Let (X, /i) be a standard probability space on which G 
acts by measure-preserving transformations and let S = {sf 1 , . . . , s^ 1 }. Note \S\ = 2r. Let 
F : V —>■ M. be defined as in the introduction. 

Proposition 5.1. Let a EV. If a is a splitting of a then F(a) < F(a). 

Proof. By induction, it suffices to prove the proposition in the special case in which a is a 
simple splitting of a. So let a = aVt(3 for some i £ S 1 and coarsening [3 of a. For any s G S, 

H(ay sa) = H(aV sa) + H(aV sa\aV sa) 

= H(a V sa) + H(sa\a V sa) + H(a\a V sa V sa) 
< H(aV sa) + H(a\aV s^a) + H(a\aV sa). 

The last inequality occurs because fi is G-invariant, so H(sa\a V sa) = H(a\a V s~ 1 a). 
Since H(a) = H(a) + H(a\a), the above implies 

r 

F(a) < (l-2r)(H(a) + H(a\a)) +^2H(aV sa) + H(a\aV s~ l a) + H(a\aV sa) 

i=l 

= F(a) + {l-2r)H(a\a) + ^H(a\aV sa). 

ses 

Since a < a\/ ta, H(a\a V ta) = 0. Hence 

F(a)-F(a) < (1 - 2r)H(a\a) + ^ H(a\aV sa) 

ses-{t} 

= [H{a\a\J sa) - H{a\a)\ < 0. 

ses-{t} 

□ 

Theorem 11.31 now follows from the proposition above and theorem 14.31 

Definition 11. Let K be a finite set and k a probability measure on K. Let K G be the 
product space with the product measure k g . The system (G, K G , k g ) is called the Bernoulli 
shift over G with base measure k. 

Let Ak = {x G K G | x(e) = k} where e denotes the identity element in G. Then a = 
{Ak | k G K} is the Bernoulli partition associated to K. It is generating and H(k) = H(a), 
by definition. 

Theorem 5.2. Let G — (si, . . . , s r ) be the free group of rank r. Let K be a finite set and k 
a probability measure on K . Then 

f(G,K G ,K G ) = H(K). 
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Proof. Let a be the Bernoulli partition associated to K. Let g x , . . . , g n be n distinct elements 
of G. It follows from the Bernoulli condition that the collection {gi(x}™ =l of partitions is 
independent. This means that if j : {1, . . . , n} — > K is any function then 



n n 



i=l i=l 

It is well-known that this implies 

n n 

h(\J gio^j = S ^2H(g i a) = nH(a). 

i=l i=l 

See, for example, [G103, prop. 14.19, page 257]. So for any k > 1, 

ses 

= (l E l 5 ( e ' fc ) U - - l)|S(e, fc)|ff(a). 

ses 

Suppose r > 1. Then, since G = (si, . . . , s r ) is free, it is a short exercise to compute: 

(|^| -1)^-1 



|B(e,*)| = l + |5|- 



\B(e,k)UB(s,k)\ = 2- 



|5|-2 ' 
151-2 



for all s G S. Thus, 

iV) = g(a) (\S\ ~ 1 - - 1) - (\S\ - l)l^l^ 1)fc " 1 



H(a). 



If r = 1 then \B(e,k)\ = 2k + 1 and \B(e, k) UB(s,k)\ = 2k + 2. So it follows in a similar 
way that F(a k ) = H{a). So f(G, X, //) = linn^ F(a k ) = H(a) = H(k). □ 

Proof of theorem \l.l[ According to Stepin's theorem [St 75], if (Kx, Ki), (K 2 , k 2 ) are standard 
Borel probability spaces with H(k\) = H(k 2 ) then (G, iff , nf ) is measurably conjugate to 
(G,K 2 G ,«f). 

Now suppose (Ki,Kx), (K 2 , k 2 ) are Borel probability spaces such that (G, K^, ) is 
measurably conjugate to (G, K 2 , K 2 ). Let Ai), (L 2 , A 2 ) be probability spaces with \Li\ + 
\L 2 \ < 00 and H(X i ) = Hfa) for i — 1,2. By Stepin's theorem, (G,Lf,Xf) is measurably 
conjugate to (G, Kf , nf). By the above theorem, f(G,Lf,Xf) = H(Xi). Since / is a 
measure-conjugacy invariant, H(ki) = H(k 2 ). □ 
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